ZooKeeper’s atomic broadcast protocol: Theory and practice
نویسنده
چکیده
Apache ZooKeeper is a distributed coordination service for cloud computing, providing essential synchronization and group services for other distributed applications. At its core lies an atomic broadcast protocol, which elects a leader, synchronizes the nodes, and performs broadcasts of updates from the leader. We study the design of this protocol, highlight promised properties, and analyze its official implementation by Apache. In particular, the default leader election protocol is studied in detail.
منابع مشابه
Consensus in a Box: Inexpensive Coordination in Hardware
Consensus mechanisms for ensuring consistency are some of the most expensive operations in managing large amounts of data. Often, there is a trade off that involves reducing the coordination overhead at the price of accepting possible data loss or inconsistencies. As the demand for more efficient data centers increases, it is important to provide better ways of ensuring consistency without affe...
متن کاملOptimistic Asynchronous Atomic Broadcast
This paper presents a new protocol for atomic broadcast in an asynchronous network with a maximal number of Byzantine failures. It guarantees both safety and liveness without making any timing assumptions or using any type of “failure detector.” Under normal circumstances, the protocol runs in an “optimistic mode,” with extremely low message and computational complexity — essentially, just perf...
متن کاملTail Latency in ZooKeeper and a Simple Reimplementation
ZooKeeper [1] is a commonly used service for coordinating distributed applications. ZooKeeper uses leader-based atomic broadcast for writes, so that all state modifications are globally totally ordered, but it allows stale reads from any server for high read availability. This design trades high read throughput for potentially high write latency. Unfortunately, the extent of this tradeoff and t...
متن کاملMajorCAN: A Modification to the Controller Area Network Protocol to Achieve Atomic Broadcast
It has already been reported that the CAN protocol produces data inconsistencies in some scenarios that violate the Atomic Broadcast properties. It has been proposed a set of higher level protocols to achieve Atomic Broadcast on CAN based systems. This approach causes considerable overhead. We propose a more efficient solution through small modifications to CAN. Our first proposal of modified p...
متن کاملFault-Tolerant Broadcasts in CAN
Fault-tolerant distributed systems based on eld-buses may take advantage from reliable and atomic broadcast. There is a current belief that CAN native mechanisms provide atomic broadcast. In this paper, we dismiss this misconception, explaining how network errors may lead to: inconsistent message delivery; generation of message duplicates. These errors may occur when faults hit the last two bit...
متن کامل